
input file Fbh21620fl.seq; Output File 21620. trans 
Sec[uence length 1909 

TACTTAGACTCAGCCGGCTTTTCCA<XX^^ 

tgcgccgttacagatcx::aagctctgaaa^ 

CCGGATC(X!GTGATCCAAATCTAAGAAC1^^ 
TTCTGCTCTAAAAGCTGCGGAATT^ 

CGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGC^ ' 

M A R P G M E *-. R* W R -D R-* - l 13 
CGAGX3CGGACGGGCGGCCnX3GGCCC ATG GCC AGG CCC GGC ATG GAG- CGGr TGG CGC-GAC *CX3G CTCr 39 

^ ^ ^ T G A S G G I G A A V A ' 'r A ^/ Q * 33 

GCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCC GTG GGC^CGGGCC CTG GTc' GAG " 99 
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^^V^YVLSTPAHIQ.IGDIQM253 
GAG GCT GTT ATC TAC GTC CTC AGC ACT CCC GCA CAt ATC CAG ATT GGA GAC ATC CAG ATG 759 

R PTEQ VT * 261 
AGG CCC ACQ GAG CAG GTG ACC TAG 733 

TGACTGTGGGAGCTCCTCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGG^ 

ATCACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTCAGATT'^ 

ATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCCTAAT^ 
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CCATCTICTTCCACCTCAACGTC^^ 

ATCCAGGGCTCCAGACTTCCTCCTC^^ 

CCCAGTCTTCGCTTCI^^ 

GGCCCAC?ro3ATTTCATGGTGATCATT^^ 

TAGTYTAGAGAAAAAACCTCCCACACCTCC^ 
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Analysis of 21620 (260 aa) 
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Signal Peptide Predictions for 21620 



1 Method 
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Score II Mat@ 
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.11 1 



Note: amino-Ierniinal 70aa used for signal peptide prediction 



Transmembrane Segments Predicted by MEMSAT 
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Prosite Pattern Matches ^B21620 



> F . -??r-r'0^? I PDOCOOOOS l PKC_PHQSPH0_SITE Protein kinase C phosphorylation site. 
Query: 135 SMK 137 

>E^i00i^|PDOC00006|CK2_PHOSPHO_SITE Casein kinase IT phosphorylation site. 

Query: 12 SNEE 7S 

Query: 89 SCVD 92 



Anatysis Of 21620 

Query: 13S SMKE 138 

>i:^££COi| PDOCOOOOS I MYRISTYL N-myristoyiation site. 



Query : 
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23 


Query : 


24 
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GCARTV 


45 


Query; 


90 


GVDICI 


95 


Query : 


109 


GSTSGW 


114 


Query : 


199 


GWETQ 


204 



>£Si>£^|PDOC00060|ADH_SHORT Short-chain alcohol dehydrogenase family si. 
Query: 166 YSATKYAVTAL 17 6 
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Input file Fbh33756.seQ; Output File 33756, trans 
Sequence length 1153 

CCGCGCCCCX3CCCTCGC7«XXX:AN^^ 

<XXXX;CTCTTCCTCGGAGCTACCCAGGCGQC^ 

CCCGGCCCGQCATCAGCCGCTACCTGCTGra 

MEKCEAAAKDIRG 13 

GGCAACATCATCCTQQCCTGCCGAGAC ATG GAG AAG TGT GAG GCG GCA GCA AAG GAG ATC CGC GGG 39 

ETLNHHVNARHLDLASLKSI 33 

GAG ACC CTC AAT CAC CAT GTC AAC GCC CGG CAC CTG GAG TTG GCT TCC CTC AAG TCT ATC 99 

REFAAKIIEEEERVDILINN 53 

CGA GAG TTT GCA GCA AAG ATC ATT GAA GAG GAG GAG CGA GTG GAC ATT CTA ATC AAC AAC 159 

AGVMRCPHWTTEDGFEMQFG 73 

GCG GGT GTG ATG CGG TGC CCC CAC TGG ACC ACC GAG GAC GGC TTC GAG ATG CAG TTT GGC 219 

VNHLGHFLLTNLLLDKLKAS 93 

GTT AAC CAC CTG GGT CAC TTT CTC TTG ACA AAC TTG CTG CTG GAC AAG CTG AAA GCC TCA 279 

,^iA PSRI INLSSLAHVA GH I D F 113 

"Sec CCT TCG CGG ATC ATC AAC CTC TCG TCC CTG GCC CAT GTT GCT GGG CAC ATA GAC TTT 339 

^i^-DDLNWQTRKYNTKAAYCQSK 133 

^•^AC GAC TTG AAC TGG CAG ACG AGG AAG TAT AAC ACC AAA GCC GCC TAC TGC CAG AGC AAG 399 

lyL AIVLFTKELSRRLQGSGVT 153 

^ yTC GCC ATC GTC CTC TTC ACC AAG GAG TTG AGC CGG CGG CTG CAA GGC TCT GGT GTG ACT 459 

=\VNALHPGVARTELGRHTGIH 173 

:''6tC AAC GCC CTG CAC CCC GGC GTG GCC AGG ACA GAG CTG GGC AGA CAC ACG GGC ATC CAT 519 

l::^GSTFSSTTLGPIFWLLVKS P 193 

IPPGC TCC ACC TTC TCC AGC ACC ACA CTC GGG CCC ATC TTC TGG CTG CTG GTC AAG AGC CCC 579 

^Ie LAAQPSTYLAVAEELADVS 213 

'tSAG CTG GCC GCC CAG CCC AGC ACA TAC CTG GCC GTG GCG GAG GAA CTG GCG GAT GTT TCC 639 

GKYFDGLKQKAPAPEAEDEE 233 

GGA AAG TAC TTC GAT GGA CTC AAA CAG AAG GCC CCG GCC CCC GAG GCT GAG GAT GAG GAG 699 

VARRLWAESARLVGLEAPSV 253 

GTG GCC CGG AGG CTT TGG GCT GAA ACT GCC CGC CTG GTG GGC TTA GAG GCT CCC TCT GTG 759 

REQPLPR* 261 

AGG GAG CAG CCC CTC CCC AGA TAA 783 

CCTCTGGAGCAGATTTGAAAGCCAGGATGGCGCCTCCAGACCGAGGACAGCTCTCCGCCATGCCCG^ 
CTACCTGAGCCGGGAGACCCAGGACTG 
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Analysis of 33756 (260 aa) 




CW5 



I * I • I « I ' I ' I M * I ' f M * 1 ' 1 ' I ' f 
1 41 81 121 161 201 241 



Signal Peptide Predictions for 33756 



Method 


Predict 


Score 


Mat@[ 


SignalP (eukaryote) 


NO 







Note: amino-tcrminal 70aa used for signal peptide prediction 



No TM domains predicted by MEMSAT for 33756 
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Prositc Pattern Matches for 



iill PDCXrOOOOl I ASN_GLYCOSyLATI0N N-glycosy lation site. 



Query: 100 NLSS 103 

>PgC'v('^"r I PDOC00005 I PKC_PHOSPHO_SITE Protein kinase C phosphorylation site. 



Query: 


29 


SLK 


31 


Query : 


32 


SIR 


34 


Query : 


120 


TRK 


122 


Query : 


144 


SRR 


146 


Query; 


213 


SGK 


215 


Query : 


242 


SAR 


244 


Query : 


252 


SVR 


254 



> 1 ? •■ 0 ■ ■ n I PDOC 00006|CK2_PHOS PHO_s ITE Casein kinase ri phosphorylation site. 



1 : puery : 


3 2 


SIRE 


35 




63 


TTED 


66 


Query: 


252 


SVRE 
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ll> iSOijO0^ I PDOC00008 jMYRTf^TYl. N-myris toy lacion site. 



i fPuery : 
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i. JOuery : 
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'':=l3uery : 
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Input file Fbh21676-seq; Output File 21676, trans 
Sequence length 1699 

GC^mnX3GGTCCCTTCTT^IAAATTGGG^^^ 

TGTCGGGGGCTCCTTGAAGCCTACTAGATCANCG^ 

C^m3GCTTTGCTTCAAACTTCTAAGGT^^ 

CCAGAAAGTTAACTGGTAAGTTTAGlxnTTTT^^ 

CTGCTCCTCAGTGGATGTTGCCTTTACT^ 

TACGACTCACTATAGGGAGTCGACCCACGCXSTCCGCGGACGCGT^^ 

; M S R Y 4 

GTGTGCAGCAAGCTCCGCGCCGACTCCGGACGCCTGACGC'CTG^ ATG AGC CGC TAG 12 

LLPLSALGTVAGAAVLLKDY 24 
CTG CTG CCG CTG TCG GCG CTG GGC ACG GTA GCA GGC GCC GCC GTC CTC CTC AAG GAG TAT 72 

VTGGACPSKATIPGKTVIVT 44 
GTC AGO GGT GGG GCT TGC GCC AGC AAG GCC ACC ATC CCT GGG AAG ACG GTC ATC GTG ACG 132 

GANTGIGKQTALELARRGGN 64 
GGC GCC AAG ACA GGC ATC GGG AAG CAG ACC GCC TTG GAA CTG GCC AGG AGA GGA GGC AAC 192 

I^LACRDMEKCE AAAKDIRG 84 
ATC ATC CTG GCC TGC CGA GAC ATG GAG AAG TGT GAG GCG GCA GCA AAG GAC ATC CGC GGG 252 

ETLNHHVNARHLDLASLKS I 104 
GAG ACC CTC AAT CAC CAT GTC AAC GCC CGG GAC CTG GAC TTG GCT TGC CTC AAG TCT ATC 312 

f^EFAAKIIEEEERVDILlNN124 
CGA GAG TTT GCA GCA AAG ATC ATT GAA GAG GAG GAG CGA GTG GAC ATT CTA ATC AAC AAC 372 

AGVMRCPHWTTEDGFEMQFG 144 
GCG GGT GTG ATG CGG TGC CGC CAC TOG ACC ACC GAG GAC GGC TTC GAG ATG CAG TTT GGC 432 

VNHLGHFLLTNLLLDKLKAS164 
GTT AAC CAC CTG GGT CAC TTT CTC TTG ACA AAC TTG CTC CTG GAC AAG CTG AAA GCC TCA 492 

APSRIINLSSLAHVAGHIDF184 
GCC CCT TCG CGG ATC ATC AAC CTC TCG TCC CTC GCC CAT GTT GCT GGG CAC ATA GAC TTT 552 

DDLNWQTRKYNTKAAYCQSK204 
GAC GAC TTC AAC TGG CAG ACG AGG AAG TAT AAC ACC AAA GCC GCC TAG TCC CAG AGC AAG 612 

^AIVLFTKELSRRLQGSGVT 224 
CTC GCC ATC GTC CTC TTC ACC AAG GAG CTC AGC CGG CGG CTC CAA GGC TCT GGT GTC ACT 672 

VWALHPGVARTELGRHTGIH 244 
GTC AAC GCC CTC CAC CCC GGC GTC GCC AGG ACA GAG CTC GGC AGA CAC ACG GGC ATC CAT 73 2 

GSTFSSTTLGPIFWLLVKSP264 
GGC TCC ACC TTC TCC AGC ACC ACA CTC GGG CCC ATC TTC TCG CTC CTC GTC AAG AGC CCC 792 

ELVAQPSTYLAVAEELADVS284 
GAG CTC GTC GCC CAG CCC AGC ACA TAC CTC GCC GTC GCG GAG GAA CTC GCG GAT GTT TCC 852 

^^YFDGLKQKAPAPEAEDEE304 
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GGA AAG TAC TTC GAT GGA CTC AAA GAG AAG GCC CXX3 GCC CCC GAG GCT GAG GAT GAG GAG 912 

VARRLWAESARLVGLEAPSV 324 
GTG GCC CGG AGG CTT TGG GCT GAA AGT GCC CGC CTG GTG GGC TTA GAG GCT CCC TCT GTC 972 

R E Q P L P R * 332 
AGG GAG CAG CCC CTC CCC AGA TAA 996 

CCTCTGGAGCAGATTTGAAAGCCAGGATGGCGCCTCCAGACCGAGGACAGCTG^ 

CTACCTGAGCCGGGAGACCCAGGACTGGCGGCCGCTAGACTAGTCTAGAGAAAAAACCTCC 
TGAAACAT 
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Signal Peptide Predictions for 21676 



1 Method 


Predict II Score 


Mat@ 


SignalP (eukaryote) 


YES II 


17 



Note: amino-terminal 70aa used for signal peptide prediction 



Transmembrane Segments Predicted by MEMSAT 



Start 


End 


Orient || Score 
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Transmembrane segments for presumed mature peptide 
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Prosite Pattern Matches for 21676 



# 



> PSOOOQ1 I PDOCOOOOl I ASN_GLYCOSYLATION K-glycosylation site. 
Query: 171 NLSS 174 

> pqnn005 I PPOC00005 I PKC_PH0SPH0_SITE protein kinase C phosphorylati 



Query: 


100 


SLK 


102 


Query : 


103 


SIR 


105 


Query : 


191 


TRK 


193 


Query: 


215 


SRR 


217 


Query : 


284 


SGK 


286 


Query: 


313 


SAR 


315 


Query: 


323 


SVR 


325 



> PS00006 1 PCOC00006 lCK2_PHOSPHO_SITE Casein kinase II phosphorylati 



Query: 


54 


TALE 


57 


Query; 


103 


SIRE 


106 


Query: 


134 


TTED 


137 


Query: 


323 


SVRE 


326 



> p;?OC'008 I PDOC00008 I MYRISTYL N-myristoylat ion site. 



Query : 


12 


GTVAGA 


17 


Query : 


28 


GACPSK 


33 


Query: 


45 


GANTGI 


SO 


Query: 


220 


GSGVTV 


225 


Query: 


231 


GVARTE 


236 


Query : 


242 


GIHGST 


247 
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Input file Fbh21612fll,seq; CXitput File 21612. trans 
Sequence length 2535 

AGGCAGAAGTATGCAAAGCATGCATCTCAAATTAGTCAGCAAACCATAGTCC^ 

CTAACTCCGNCCCAGTTCCGGCCCATTCTCCG^ 

TCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTT^^ 

GGGCTCGCATCTCTCCTTCACGCGCCCGCCGCCCTAC^ 

TCCCGCCTGTGGTGCCTCCTGAACTGCGTC^ 

GCGCTCCCTTGGAGCCTACCTAGACTCAGCCGGCTCTCCAC^ 

TTTCAGTTTTCTGTTCTGCGCCGTTACAGATCCAAGCTC 

TCTTTTATTTCAGGTCCCGGATCCGGTGGTGGTGCA^ 

GCCTGTACGGAAGTGTTACTTCTGCTCTAAAAGCTGCGGAAT^ 

MLPNTGR 7 

CCGCTCGCCGCCGCCGCTGTCGCCGCCACCTCCTCTGATCTACGAAAGTC ATC TTA CCC AAC ACC GGG AGG 21 

^3 3-AGCTVFITGASRGIGKAIA 27 

CTG GCA GGA TGT ACA GTT TTT ATC ACA GGT GCA AGO CGT GGC ATT GGC AAA GOT ATT GCA 81 

LKAAKDGANIVIAAKTAQPH 47 

-::.!: TTG AAA GCA GCA AAG GAT GGA GCA AAT ATT GTT ATT GCT GCA AAG ACC GCC CAG CCA CAT 141 

I ^ 

'■••'f PKLLGTIYTAAEEIEAVGGK 67 

^-:3 CCA AAA CTT CTA GGC ACA ATC TAT ACT GCT GCT GAA GAA ATT GAA GCA GTT GGA GGA AAG 201 

H ALPCIVDVRDEQQISAAVEK 87 

GCC TTG CCA TGT ATT GTT GAT GTG AGA GAT GAA CAG CAG ATC AGT GCT GCA GTG GAG AAA 261 

AIKKFGGIDILVNNASAISL 107 

GCC ATC AAG AAA TTT GGA GGA ATT GAT ATT CTG GTA AAT AAT GCC AGT GCC ATT AGT TTG 321 

^3 TNTLDTPTKRLDLMMNVNTR 127 

ACC AAT ACA TTG GAC ACA CCT ACC AAG AGA TTG GAT CTG ATG ATG AAC GTG AAC ACC AGA 381 

GTYLASKACIPYLKKSKVAH 147 

GGC ACC TAC CTT GCA TCT AAA GCA TGT ATT CCT TAT TTG AAA AAG AGC AAA GTT GCT CAT 441 

ILNISPPLNLNPVWFKQHCA 167 

ATC CTC AAT ATC AGT CCA CCA CTG AAC CTA AAT CCA GTT TGG TTC AAA CAG CAC TCT GCT 501 

YTIAKYGMSMYVLGMAEEFK 187 

TAT ACC ATT GCT AAG TAT GGT ATG TCT ATC TAT GTC CTT GGA ATC GCA GAA GAA TTT AAA 561 

GEIAVNALWPKTAIHTAAMD 207 

GGT GAA ATT GCA GTC AAT GCA TTA TGG CCT AAA ACA GCC ATA CAC ACT GCT GCT ATC GAT 621 

J^LGGPGIESQCRKVDI lADA 227 

ATC CTC GGA GGA CCT GGT ATC GAA AGC CAG TCT AGA AAA GTT GAT ATC ATT GCA GAT GCA 681 

AYSIFQKPKSFTGNFVIDEN 247 

GCA TAT TCC ATT TTC CAA AAG CCA AAA AGT TTT ACT GGC AAC TTT GTC ATT GAT GAA AAT 741 



KEEGIENFDVYAIK 



H P 267 



FFGURE 15 



ATC TTA AAA GAA GAA GGA ATA GAA AAT TTT GAC GTT TAT GCA ATT AAA CCA GGT CAT CCT 



801 



LQPDFFLDEYPEAVSKKVES 287 

TTG CAA CCA GAT TTC TTC TTA GAT GAA TAC CCA GAA GCA GTT AGC AAG AAA GTC GAA TCA 861 

'TGAVPEFKEEKLQLQPKPRS 307 

ACT GGT GCT GTT CCA GAA TTC AAA GAA GAG AAA CTC CAG CTC CAA CCA AAA CCA CGT TCT 921 

GAVEETFRIVKDSLSDDVVK 327 

GGA GCT GTG GAA GAA ACA TTT AGA ATT GTT AAG GAC TCT CTC AGT GAT GAT GTT GTT AAA 981 

ATQAIYLFELSGEDGGTWFL 347 

GCC ACT CAA GCA ATC TAT CTG TTT GAA CTC TCC GGT GAA GAT GGT GGC ACG TCG TTT CTT 1041 

DLKSKGGNVGYGEPSDQADV 367 

GAT CTG AAA AGC AAG GGT GGG AAT GTC GGA TAT GGA GAG CCT TCT GAT CAG GCA GAT GTG 1101 

VMSMTTDDFVKMFSGKLKPT 387 

GTG ATG AGT ATG ACT ACT GAT GAC TTT GTA AAA ATG TTT TCA GGG AAA CTA AAA CCA ACA 1161 

MAFMSGKLKIKGNMALAIKL 407 

ATG GCA TTC ATC TCA GGG AAA TTC AAG ATT AAA GGT AAC ATC GCC CTA GCA ATC AAA TTC 1221 

i;3 EKLMNQMNARL* 4I9 

GAG AAG CTA ATC AAT CAG ATC AAT GCC AGA CTC TCA 1257 

AGGAAAATATAAAAAAAAAGTCGACTGCTATGCTCAAAAAGTAAAAAAAGCTC^CAGTTAAAATCTAATGTT^ 
=; CTTTCCTCTTATATTATAAGGATATGCACGTTTCTTCTGGAAAAGATAGAATTTCTCTCTAAAAGACTTC 

I lI ttaaaatggcaagctaatcaaacataagcttcattaagtgggattctaagacagtctctctttttatatttc^ 

^. TAACCCTTTGAGCCTTACATCTCATTCACTCTCTTrcTCCAAGAAAAGTATTTTGGGCGGACAGTCAGATC 

AAATTAGCTCTTTCAAATCTTCTTCTCATCTAAAATCAAGCTAGTCTCTTTTAAAATTTTTAGTTTTGGATTCT^^ 
V ^ AATCAAAATCTTAATGATCTTTTKRWTTTTTATATAC YTAWTTTWAARRAAAW^ 

5 3 TWTTAAAAAWKRRWWV\naftra^YTSKGSGmRASWMWAWRWRAm 
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Analysis of 21612 (418 aa) 
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signal Peptide Predictions for 21612 



|: Method 


Predict II Score 


Mat@ 


PSignalP (eukaryote) 


NO 1 





l^ate: amino -terminal 70aa used for signal peptide prediction 



TM domains predicted by MEMSAT for 21612 
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Prosite Pattern Mat<^Bfor 21612 



>fSiHiii^|PDOC00001|ASN_GLYCOSYUATION N-glycosylacion site. 
Query: 101 NASA 104 

>£Siliii£l|PDOC00005|PKC_PHOSPHO_SITE Protein kinase C phosphorylati 



Query: 


5 


TGR 


7 


Query: 


115 


TKR 


117 


Query: 


282 


SKK 


284 


Query: 


313 


TFR 


315 


Query : 


381 


SGK 


383 


Query : 


392 


SGK 


394 



>i:iiiimilPDOC00006tCK2_PHOSPHO_SITE Casein kinase II phosphorylati 



Query: 


56 


TAAE 


59 


Query : 


320 


SLSD 


323 


Query : 


338 


SGED 


341 


Query : 


372 


TTDD 


375 



> PS'jOOOo I PDOC00008 [MYRISTYL N-myristoy lat ion site. 



Query: 


17 


GASRGI 


22 


Query : 


52 


GTIYTA 


57 


Que try : 


128 


GTYLAS 


133 


Query : 


353 


GGra/GY 


358 



> FS0G342 I PDOC00299 [MICROBODIES CTER Microbodies C-teoninal targeting signal. 
Query: 416 ARL 418 



FIGURE 18 




> Fbli21615a - Impoirt - cozQplete 





1 


ATGCAAAAGC 


CGAGNCCGCC 


TCGGCCTCTA 


AGCTATTCCA 


GAAGTAGTAA 


GAAGGCTTTT 




61 


TTGAAGGCCT 


AGGCTTTTGC 


AAAAAGCTCC 


TCGATCGAGG 


GGCTCGCATC 


TCTCCTTCAC 




121 


GGGGCCGCCG 


CCCTACCTGA 


GGCCGCCATC 


CACGCCGGTT 


GAGTCGCGTT 


CTGCCGCCTC 




181 


CCGCCTGTGG 


TGCCTCCTGA 


ACTGCGTCCG 


CCGTYTAGGT 


AAGTTTAAAG 


CTCAGGTCGA 




241 


GACCGGGCCT 


TTGTCCGGCG 


CTCCCTTGGA 


GCCTACCTAG 


ACTCAGCCGG 


CTCTCCACGC 




301 


TTTGCCTGAC 


CCTGCTTGCT 


CAACTCTACG 


TCTTTGTTTC 


GTTTTCTGTT 


CTGCGCCGTT 




3 61 


ACAGATCCAA 


GCTCTGAAAA 


ACCAGAAAGT 


TAACTGGTAA 


GTTTAGTCTT 


TTTGTCTTTT 




421 


ATTTCAGGTC 


CCGGATCCGG 


TGGTGGTGCA 


AATCAAAGAA 


CTGCTCCTCA 


GTGGATGTTG 




481 


CCTTTACTTC 


TAGGCCTGTA 


CGGAAGTGTT 


ACTTCTGCTC 


TAAAAGCTGC 


GGAATTCTAA 




541 


TACGACTCAC 


TATAGGGAGT 


CGACCCACGC 


GTCCGCAAAC 


CGAGTTCTGG 


AGAACGCCAT 


iU 


601 


CAGCTCGCTG 


CTTAAAATTA 


AACCACAGGT 


TCCATTATGG 


GTCGACTTGA 


TGGGAAAGTC 




661 


ATCATCCTGA 


CGGCCGCTGC 


TCAGGGGATT 


GGCCAAGCAG 


CTGCCTTAGC 


TTTTGCAAGA 




721 


GAAGGTGCCA 


AAGTCATAGC 


CACAGACATT 


AATGAGTCCA 


AACTTCAGGA 


ACTGGAAAAG 




781 


TACCCGGGTA 


TTCAAACTCG 


TGTCCTTGAT 


GTCACAAAGA 


AGAAACAAAT 


TGATCAGTTT 




841 


GCCAATGAAG 


TTGAGAGACT 


TGATGTTCTC 


TTTAATGTTG 


CTGGTTTTGT 


CCATCATGGA 




901 


ACTGTCCTGG 


ATTGTGAGGA 


GAAAGACTGG 


GACTTCTCGA 


TGAATCTCAA 


TGTGCGCAGC 




961 


ATGTACCTGA 


TGATCAAGGC 


ATTCCTTCCT 


AAAATGCTTG 


CTCAGAAATC 


TGGCAATATT 




1021 


ATCAACATGT 


CTTCTGTGGC 


TTCCAGCGTC 


AAAGGAGTTG 


TGAACAGATG 


TGTGTACAGC 




1081 


ACAACCAAGG 


CAGCCGTGAT 


TGGCCTCACA 


AAATCTGTGG 


CTGCAGATTT 


CATCCAGCAG 




1141 


GGCATCAGGT 


GCAACTGTGT 


GTGCCCAGGA 


ACAGTTGATA 


CGCCATCTCT 


ACAAGAAAGA 




1201 


ATACAAGCCA 


GAGGAAATCC 


TGAAGAGGCA 


CGGAATGATT 


TCCTGAAGAG 


ACAAAAGACG 




1261 


GGAAGATTCG 


CAACTGCAGA 


AGAAATAGCC 


ATGCTCTGCG 


TGTATTTGGC 


TTCTGATGAA 




1321 


TCTGCTTATG 


TAACTGGTAA 


CCCTGTCATC 


ATTGATGGAG 


GCTC^AGGTT GTGATTTTAG 




1381 


GATCTCCATG 


GTGGGAAGGA 


AGGCAGGCCC 


TTCCTATCCA 


CAGTGAACCT 


GGTTACGAAG 




1441 


AAAACTCACC 


AATCATCTCC 


TTCCTGTTAA 


TCACATGTTA 


AfTGAAAATAA 


GCTCTTTTTA 




1501 


ATGATGTCAC 


TGTTTGCAAG 


AGTCTGATTC 


TTTAAGTATA 


TTAATCTCTT 


TGTAATCTCT 




1561 


TCTGAAATCA 


TTGT7VAAGAA 


ATAAAAATAT 


TGAACTCAAA 


AAAAAAAAAA 


AAAAAAGGGC 




1621 


GGCCGCTAGA 


CTAGTCTAGA 


GAAAAAACCT 


CCCACACCTC 


CCCCTGAACC 


TGAAACATAA 




1681 


AATGAATGCM 


ATTGTTGKTG 


GTAACTTGTT 


ATTGCA 







FIGURE 19A 



Fbli21615a - Xinport: - con^lete 



EGAKVIATDI NESKLQELEK YPGIQTRVIiD 
TVLDCEEKDW DFSMNLNVRS MYLMIKAFLP 
TTKAAVIGLT KSVAADFIQQ GIRCNCVCPG 
GRFATAEEIA MLCVYLASDE SAYVTGNPVI 



MGRliDGKV IILTAAAQGI GQAAALAFAR 
VTKKKQIDQF ANEVERLDVL FWAGFVHHG 
KMLAQKSGNI INMSSVASSV KGWNRCVYS 
TVDTPSLQER IQARGNPEEA RNDFLKRQKT 
IDGGWSB*' 



FIGURE 19B 



21615prot 
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H Alpha. Regions - Gamier-Robson 

■ Alpha, Regions - Chou-Fasman 
H Beta. Regions - Gamier-Robson 

■ Beta. Regions - Chou-Fasman 

■ Tum, Regions - Gamier-Robson 

■ Turn, Regions - Chou-Fasman 
DCoil, Regions - Gamier-Robson 

■ Hydrophilicity Plot - Kyte-Doolittle 

■ Alpha, Amphipathic Regions - Eisenberg 

■ Beta, Amphipathic Regions - Eisenberg 
S Flexible Regions - Karplus-Schuiz 



Q Antigenic Index - Jameson-Wolf 



a Surface Probability Plot - Emini 
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FIGURE 20 



Back ro orfanal-r^i 

Analysis of 21615 (245 aa) 



PFft« 



Pr^Dcm 



adh_stK>nt 



adh_ short _C2 
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Signal Peptide Predictions for 21615 



Method 11 PredicTj j ScoreJlMat@ 



SigaalP (eukaryote) fuO [ [ 



Nole: amino-terminal 70aa used for signal peptide predict!* 



Transmembrane Segments Predicted by MEMSAT 



Start 


End J Onent | Score ( 
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27 Jlns— >out 
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FIGURE 21 



Prosite 



l^l^ern Matches for 21615 



>£S2fifiai|PDOC00001|ASN_GLYCOSVLATICtt^ N-glycosylation site. 

Query: 39 NESK 42 

Query: 130 NMSS 133 

>PS9Q095 1 PDOC00005 I PKC_PHOSPHO_SrTE Protein kinase C phosphorylation site. 

Query: 60 TKK €2 

Query: 137 SVK 139 



Query: 149 TTK 151 

Query: 208 TGR 210 

> PS00006 I PDOC0QQQ6 |CK2 PHQ<;PHn gTTK Casein kinase II phosphorylation site. 

Query: 89 TVLD 92 

Query: 184 SLQE 187 

Query: 213 TAEE 216 

>£Sii£i:m2| PDOC00007|TYR_PHOSPHO_SITE Tyrosine kinase phosphorylation site. 
Query: 42 KLQELEKY 4 9 

> ?S0OOOB I PDQCQOQOa | MYRTaTYr, N-myris toy lation site. 

Query: 17 GIGQAA 

Query: 126 GNIINM 

Query: 156 GLTKSV 

Query: 169 GIRCNC 

> PS00061 I PDOCQQQ60| ADH SHORT Short-chain alcohol dehydrogenase family signature. 

Query: 147 YSTTKAAVIGL 157 



22 
131 
161 
174 



FIGURE 22 
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